ADA  0395  6 6 


REA!/  INSTRUCTION F 
■ORE  COMPORTING  E< 


REPORT  DOCl  •AENTATlOt 


ATit  k£-t  easL^nkiiUaX 


Synchronization  and  Computing  Capabilities 
of  Linear  Asynchronous  Structures  , 


Technical 


6.  performing  org.  report  mum 


6.  CONTRACT  OR  GRANT  NUMBER("«J 


/ g)  Richard  J./Lipton,  1 
Raymond  E. /Miller  / 

Lawrence/  Snyder  / 

9.  PERTBfc'MiNG  ORGANIZATION  NAME  AND  ADDRESS 

Yale  University 

Department  of  Computer  Science 


N0C014-75-C- 

/VS'jT-'TV? 


10  Hillhouse  Ave 


controlling  office  name  and  address 
Office  of  Uaval  Research 
Information  Systems  Program 
Arlington,  Virginia  22217 


M.  MONITORING  AGENCY  NAME  6 ADDRESS^//  dilterent  trom  Controlling  Office) 


DEC  L ASSi  F|  CATION.  DOWNGRADING 
SCHEDULE. 


16-  DISTRIBUTION  STATEMENT  (ol  thin  Report) 


Distribution  of  this  report  is  unlimited 


17.  DISTRIBUTION'  STATEMENT  (ot  the  abstract  entered  in  Rlock  20,  It  dilterent  trom  Kei 


18.  SUPPLEMENTARY  NOTES 


IS.  KEY  WORDS  (Continue  on  tevorae  aide  it  necessary  arid  identity  by  block  number ) 

parallel  systems  cellular 

linear  asynchronous  structures 
firing  squad  synchronization  problem 
Church  Rosser  Theorem 


S.  JtF*S TT<  A C T ( Continue  on  teverae  s'de  If  necessary  and  Identity  by  block  number) 


in  which  questions  concerning  delay  bounded  asynchrono 

ynchronization  pro- 
i,',>cannot  be  solved 
Three  conditions  called  persistence, 
e conditions  are  shown  to 
execution  policy  can  be  re- 
icy  with  no  change  to  the  result  of  the 
>e  theorem 


parallel  systems  may  be  investigated,  it  is  shoen  that  sy;. 
blems , similar  to  the  firing  squad  synchronization  problem 
by  delay  bounded  asynchronous  systems 
determinacy,  and  single  change  are  introduced.  The; 
be  sufficient  to  guarantee  Lliat  a synchronou; 
laxed  to  an  asynchronous  execution  pol 
computation.  This  is  a Church-Rosser  tv 


but  in  addition,  the 


EDITION  OF  1 NOV  £.9  IS  OBSOLETE 


asynchronous  execution  time  is  shown  to  be  only  (D+l)  times  the 
synchronous  execution  time,  where  D is  the  delay  bound.  Finally, 
a wide  class  of  recognition  problems  is  identified  which  can  be  solved 
by  linear  asynchronous  structures. 


RC  5857 
(If  25330) 
2/11/76 
Computer 
Science 

40  pages 


SYNCHRONIZATION  AND  COMPUTING  CAPABILITIES  OF 
LINEAR  ASYNCHRONOUS  STRUCTURES^ 

* 

R.  J.  Lipton 

Department  of  Computer  Science 
Yale  University 
10  Hillhouse  Avenue 
New  Haven,  Connecticut  06520 

R.  E.  Miller 

Mathematical  Sciences  Department 
IBM  Thomas  J.  Watson  Research  Center 
P.  0.  Box  218 

Yorktown  Heights,  New  York  10598 
** 

L.  Snyder 

Department  of  Computer  Science 
Yale  University 
10  Hillhouse  Avenue 
New  Haven,  Connecticut  06520 

ABSTRACT:  A model  is  defined  in  which  questions  concerning 
delay  bounded  asynchronous  parallel  svstems  may  be  investi- 
gated. It  is  shown  that  synchronization  problems,  similar  to 
the  "firing  squad  synchronization  problem,"  cannot  be  solved 
by  delay  bounded  asynchronous  systems.  Three  conditions  called 
persistence,  determinacy,  and  single  change  are  introduced. 
These  conditions  are  shown  to  be  sufficient  to  g lar'rntee  that 
a synchronous  execution  policy  can  be  relaxed  tc  an  asynchro- 
nous execution  policy  with  no  change  to  the  result  of  the 
computation.  This  is  a Church-Rosser  type  theorem,  but  in 
addition,  the  asynchronous  execution  time  is  shown  to  be  only 
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identified  which  can  be  solved  by  linear  asynchronous 
structures . 

A preliminary  version  of  this  paper  was  presented  at  the 
IEEE  Symposium  on  Foundations  of  Computer  Science, 
Berkeley,  California,  October  1975. 

* 

** 

Supported  in  part  by  Office  of  Naval  Research  under  grant 
N00014-75-C-0752 . 

* 

Also,  supported  in  part  under  NSF  grant  DCR-74-12870,  and 
part  of  this  work  was  done  while  visiting  IBM  Research 
Center . 


I.IMITKI)  DISTRIBUTION  NOTICE 


I hi-  report  liar-  been  submitted  fur  pn lilii  ation  elsewhere  and 
has  been  issued  as  a Research  Report  fur  early  dissemination 
of  its  contents.  As  a courtesy  to  the  intended  publisher,  it 
should  not  he  widely  distributed  until  after  the  date  of  outside 
publication. 


Copies  may  be  requested  from: 

IBM  Ihomas  J.  W atson  Research  Center 

Post  < tffirr  Box  JIB 

Mirk  town  Heights.  New  \ ork  10598 


1 


1.  Introduction 

Computational  systems,  whether  they  be  hardware  or  software,  are 
usually  envisioned  as  an  interconnection  of  a number  of  separate  and  distinct 
processes.  Each  of  the  processes  is  assumed  to  perform  a particular  task, 
obtaining  inputs  from  other  processes  in  the  system  and  providing  results 
to  other  processes  in  the  system.  The  function  of  the  whole  system  is 
accomplished  through  the  combined  effort  of  the  distinct  processes  acting 
in  concert.  A specification  of  the  overall  control  of  when  processes 
are  to  act  and  communicate  with  each  other  is  usually  required  to  insure 
proper  operation  of  the  system.  In  programs  this  is  usually  done  by  specifying 
the  "flow  of  control"  of  the  program,  whereas  in  hardware  this  is  usually 
done  by  having  a centralized  control  unit  which  emits  control  signals 
to  the  processes.  As  is  well  known,  efficiencies  can  often  be  realized 
bv  having  several  processes  act  simultaneously,  or  in  parallel,  rather 
than  having  a single  sequence  of  process  actions.  Such  parallel  computation, 
however,  is  often  quite  complex  to  control,  especially  when  the  time  of 
process  performance  is  variable. 

In  this  paper  we  study  the  intercommunication  problems  for  systems 
of  interconnected  processes,  acting  in  parallel,  where  the  time  required 
for  a process  to  act  is  not  known  exactly.  As  a simplifying  assumption 
we  restrict  our  attention  here  to  linear  interconnection  of  processes. 

The  results  we  obtain  are  then  directly  applicable  to  such  "linear  structures." 
Also  this  provides  some  information  about  systems  having  more  complex 
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interconnection  since  in  any  such  system  there  are  linear  chains  of  inter- 
connected processes.  The  linearity  assumption  allows  us  to  draw  on,  and 


compare  our  results  with,  the  extensive  work  done  on  synchronous  linear 
structures.  For  example,  the  "Firing  Squad  Synchronization  Problem"  [7] 
is  one  of  the  earliest  questions  studied  in  this  context.  Since  that  time, 
a large  quantity  of  literature  has  appeared  on  cellular  machines,  iterative 
arrays,  parallel  grammars,  L-systems,  etc.  [2,4,9,10].  All  of  these  studies 
assume  synchronous  computation.  That  is,  at  each  discrete  moment  in  time, 
if  a machine  can  perform  some  transition  (or,  in  the  grammatical  case,  if  a 
production  applies)  then  that  transition  must  be  performed.  The  consequences 
of  relaxing  the  synchronous  requirement  to  asynchronous  operation  are: 
first,  that  some  tasks  which  can  be  done  synchronously  cannot  even  be 
approximated  asynchronously,  and  second,  for  those  synchronous  computations 
that  can  be  realized  asynchronously,  the  previously  used  techniques  fail  to 
apply  and  a new  set  of  techniques  must  be  developed.  The  asynchronous 
assumption  is  a useful  one  to  make  since  often  processes  have  execution 
times  which  depend  upon  the  data.  We  do  assume  that  the  times  are  known  to 
ho  within  some  upper  and  lower  bounds,  although  they  may  vary  with  tine 
within  these  bounds.  Examples  where  such  situations  arise  include  both 
cellular  arrays  of  devices,  where  eacli  device  runs  at  some  nonzero  rate, 
and  operating  systems,  where  each  process  is  given  a nonzero,  but  somowhat 
variable,  amount  of  time  to  act. 

<>ur  model,  to  be  formally  defined  in  the  next  section,  hvpothesi.  es  a 
system  of  n identically  structured  finite  state  machines  organized  as  a 
I incar  array.  Each  machine  is  allowed  to  communicate  with  other  machines 
in  its  own  neighborhood  (not  necessarily  just  with  its  adjacent  neighbors). 

The  time  is  measured  in  a relative  fashion,  with  one  step  elapsing  win  never 
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some  machine(s)  change  state.  A given  machine  is  said  to  become  active 
when  it  is  first  capable  of  a transition.  (Identity  transitions  are  not 
allowed,  so  a device  may  not  be  capable  of  another  state  change  immediately 
after  a transition  has  taken  place.)  Once  active,  the  machine  can  perform 
the  state  change  at  any  step.  However,  no  machine  can  remain  active,  with- 
out changing  state,  for  more  than  L)  steps.  The  delay,  D,  is  a nonnegative 
integral  value  which  gives  the  number  of  steps  any  processor  is  allowed  to 
remain  idle  prior  to  completing  a computational  step.  Hence,  when  D = 0, 
no  idle  steps  are  allowed,  each  processor  completes  execution  at  each  step 
and,  therefore,  the  system  is  synchronous.  When  D 0,  the  system  is 
asynchronous  and  the  processors  operate  at  a worst  case  rate  of  once  every 
D + 1 steps. 

Clearly,  because  the  rate  of  execution  is  a parameter,  the  model  to  be 
described  will  be  equally  capable  of  characterizing  synchronous , as  well  as 
asynchronous,  computation.  Indeed,  by  varying  D,  a single  system  can  be 
executed  using  either  policy.  This  facilitates  our  study  of  the  relationships 
between  synchronous  and  asynchronous  parallel  computation. 

Several  comments  are  in  order.  First,  note  that  no  assumption  is  made 
as  to  whether  or  not  the  relative  time  steps  are  of  equal  length.  Further- 
more, no  assumption  is  made  about  how  long  it  takes  for  a given  device  to 
change  state,  except  that  it  is  bounded.  Consequently,  we  are  allowing  the 
execution  time  of  a given  device  to  change  for  any  reason  whatsoever.  The 
same  transition  can  even  take  a different  number  of  steps  for  different 
devices  or  for  the  same  device  at  different  points  in  the  computation.  All 
that  is  required  is  that  it  be  bounded  by  D + 1 (D  is  fixed  for  any  given 
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computation).  This  point  of  view  is  motivated  by  an  interest  in  modelling 
parallel  circuits  as  well  as  operating  systems.  In  the  former  case,  the 
performance  of  the  device  may  be  influenced  by  physical  characterist  Lcs  of 
the  components.  In  the  latter  case,  a process  may  be  influenced  by  competi- 
tion with  other  processes  for  resources,  or  influenced  by  I/O  or  some 
other  exogenous  variables.  In  any  case,  if  the  delay  D cannot  be  chosen 
precisely  for  a given  system,  then  it  may  be  considered  to  be  a limit  beyond 
which  the  failure  of  a processor  to  execute  is  interpreted  as  a failure  of 
the  entire  system. 

A second  observation  is  that  the  assumption  of  "identically  structured" 
processes  is  not  overly  restrictive.  The  assumption  should  probably  be 
stated  as  "identically  structured  with  respect  to  the  interaction  among 
processes."  Hence,  the  interaction  of  multiple  instances  of  processes 
which  communicate  in  the  same  manner  is  being  studied.  Any  computation  not 
relevant  to  this  communication  is  allowed;  since  it  doesn't  influence  the 
overall  synchronization  behavior,  however,  it  can  be  ignored. 

Finally,  a word  of  warning  is  in  order  about  the  role  of  D.  D,  as 
it  is  used  in  the  sequel,  is  the  delay,  or  the  number  of  idle  steps  allowed 
before  a device  must  execute.  Consequently,  the  "firing  frequency"  ) or 
processors  which  are  always  active  will,  in  the  worst  case,  be  once  every 
L)  + 1.  steps.  Thus,  for  the  synchronous  case,  D = 0,  the  devices  must  fire 
at  each  step  and,  therefore,  no  idle  steps  are  allowed  for  active  processors. 

The  main  question  addressed  in  this  paper  is: 


How  do  linear  arrays  of  machines  operating  synchronously  compare 
with  linear  arrays  of  machines  operating  asynchronously  in  terms 
of  computational  and  synchronization  characteristics? 
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First  of  all  we  note  that  observed  globally,  a synchronous  array  has  precisely 
one  execution  sequence  (assuming,  as  we  do,  that  the  machines  are  determin- 
istic). By  contrast,  an  asynchronous  array  defines  a set  of  computations 
corresponding  to  the  differing  execution  rates  of  the  individual  machines. 
Obviously,  one  of  these  computations  is  a "synchronous"  computation  (in  the 
sense  that  each  machine  executes  without  any  delay).  Thus,  if  we  consider 
an  asynchronous  computation  to  be  well  behaved  if  the  computed  result  is 
independent  of  the  individual  execution  rates,  then  clearly,  anything  that 
can  be  computed  asynchronously  can  be  computed  synchronously.  Our  main 
question  thus  reduces  to:  are  asynchronous  arrays  weaker  than  synchronous 
arrays?  The  answer  depends  upon  whether  we  speak  of  synchronization  ability 
or  computational  ability. 

It  is  known  that  cellular  arrays  can  solve  a synchronization  problem 
known  as  the  "firing  squad  synchronization  problem,"  [7].  It  would  be  fool- 
ish to  expect  an  asynchronous  linear  array  to  solve  this  problem  (for  D > 0) 
since  the  soldiers  may  or  may  not  choose  to  "fire"  at  the  appointed  moment. 

But  suppose  that  we  required  all  soldiers  to  "fire"  within  an  interval  of 
size  D.  It  will  be  shown  that  this  simpler  problem  cannot  be  solved! 

Indeed,  a stronger  result  will  be  shown.  Hence,  with  respect  to  synchroniz- 
ing qualities,  the  asynchronous  linear  arrays  are  weaker  than  their  synchro- 
nous counterparts. 

By  contrast,  it  will  be  shown  that  for  language  recognition  problems, 
asynchronous  arrays  are  no  weaker  than  the  synchronous  linear  arrays.  This 
is  unexpected  since  in  the  synchronous  arrays  the  techniques  used  to  solve 
the  firing  squad  problem  are  central  to  the  solution  of  recognition  problems. 
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In  [10]  the  recognition  questions  were  analyzed  in  terms  of  the  time  required 
by  the  linear  array.  Hence,  it  is  not  only  of  interest  whether  a particular 
set  can  be  recognized,  but  the  time  required  in  comparison  to  the  synchronous 
case  is  also  relevant.  We  show  that  it  takes  at  most  3(D+1)  times  longer. 

This  last  result  uses  another  of  our  main  theorems.  Namely,  we  identify 
three  properties  of  linear  asynchronous  systems  — determinacy,  persistence, 
and  single  change  — and  show  that  these  are  sufficient  to  guarantee  that 
any  system  with  these  properties  operates  asynchronously  at  most  (D+l)  times 
slower  than  it  does  synchronously,  for  all  D.  With  this  result  we  obtain 
an  effective  strategy  for  solving  a problem  with  asynchronous  systems: 

First  find  a synchronous  system  for  the  task.  Establish  determinacy, 
persistence,  and  single  change,  and  then  invoke  the  above  theorem.  The 
validity  and  performance  are  thus  established. 

The  format  of  the  remainder  of  the  paper  is  as  follows.  Section  2 

gives  initial  definitions  and  illustrative  examples.  Section  3 shows  the 

impossibility  of  a linear  asynchronous  system  solving  the  firing  squad 

* 

synchronization  problem.  Section  4 proves  the  CR  Theorem  on  the  synchron- 
ous to  asynchronous  relationship.  Section  5 establishes  the  equivalence 
between  synchronous  and  asynchronous  recognition,  and  Section  6 poses  some 
open  problems. 


The  name  is  motivated  by  the  fact  that  this  theorem  has  a flavor  similar 
to  the  Church-Rosser  Theorem  of  Lambda  Calculus.  [1,3]. 


at.* 
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A rale  a — *■  0 models  changes  in  the  state  of  machines  in  terms  of 


their  own  states  and  the  states  of  neighboring  machines.  The  a ^ 8 


constraint  is  intuitively  natural  since  in  an  asynchronous  system  the  only 


detectable  action  is  a change  in  a state.  Remaining  in  a state  is  considered 


suppose  there  is  a production  a — *-8 


Given  a string  x 


such  that 


Thus,  successive 


modifications  to  some  initial  string  in  E will  represent  the  successive 


state  changes  in  the  linear  asynchronous  structure 


A particularly  simple  case  is  that  for  an  ordinary  array  of  n 


machines  where  any  machine's  next  state  depends  only  upon  its  current  state 


This  situation 


by  a grammar  in  which  | ot | =3.  The  middle  symbol  representing  the  machine 


where  a 


8 would  take  the  form:  a.a.a 


A precise  description  of  how  changes  can  occur  due  to  production 


Definition  2.2 


y provided  x i y and  x ^ y implies  there  exists  a 


production  a 


r 


I 


< 


(i)  3j,  1 < j < n such  that  xt_j+1  • • • •xi_j+k 


- a,  •••a.  •••oi,  , and 
1 j k 


(ii)  y.  . , = $ whenever  a ^ 8 for  s = 1,2, ••*,k. 

■'l-j+s  s s s 


Informally,  x^***xn  I — y^***yn  if  wherever  a change  takes  place  (x.^  ^ y^). 


then  there  is  some  production  matching  some  context  around  x^  (requirement 


(i))  and  that  each  change  implied  by  the  production  (a^  f Bg)  is  reflected 


in  the  result  (8  = y.  . ).  This  definition  is  quite  general,  allowing 

s i-j+s 


overlapping  application  of  productions. 

Example  2.1.  Let  G = <{1,2,3},  {12  — *■  13,  23  — + 21,  12  — *-  33}>. 
Then  the  following  are  allowed: 


123 

123 

123 

123 


133 

121 

333 

331 


Note  that  consistent  overlapping  is  allowed  and  that  in  the  last  two  cases, 
it  is  ambiguous  whether  12  — *•  13  is  applied,  since  it  is  subsumed  by 
12  ->  33. 

Example  2.2:  Let  G = <{1,2},  {12  — *■  21  }>  then 


112122  I — 121122 
112122  | — 112212 
112122  f—  121212 


Note  that  this  grammar  is  non-overlapping.  We  shall  have  more  to  say 
about  this  grammar  later. 


Definition  2.3.  Given  G = <£  , P>  and  x •••x  e T.  coordinate  i is 
— 1 n 


active  in  x,  •••x  provided  there  exists  a y,  •••y  € E such  that 

1 n r J1  n 


•*yn  and  i y±- 


.ii 


V"Xn  v 
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Notation:  Superscripts  will  be  used  to  denote  elements  of  a sequence. 

* * 

The  reflexive,  transitive  closure  of  | — is  denoted  I — . Hence  x I — y 

if  and  only  if  there  exist  x^* , x^,x^,*'*,x^  (q  > 0)  such  that 

0 . 1,2.  . q . . 

x = x | — x I — x I — • • 1 x = y.  We  write  x (— — y to  denote  a single 

application  of  production  p to  string  x,  while  x f-y  y means  that 

exactly  one  production  has  been  applied. 

We  are  now  ready  to  introduce  the  delay  property  into  our  asynchro- 
nous model. 

Definition  2.4.  Let  x^,x\***  e ln , G = <E,  P>  be  an  asynchronous 

0 12 

grammar  and  D > 0 be  an  integer.  The  sequence  x ,x  ,x  , •••  is  a 
D-computation  provided: 


(i)  Vj  - 0,  xJ 


j+1 

xJ  and 


( i i ) 'ji  i,j  such  that  x|  = x|+k  and  coordinate  i is  active  in 
xj+k  for  all  k = 0,1,2 , ••• ,D+1. 

Hence  a D-computation  is  a legal  sequence  of  state  transformations  such 
that  no  active  coordinate  remains  unchanged  for  D + 1 consecutive  steps. 

Kxample  2.3.  Let  G^  = <{1,2},  {12  — *•  21}>  then 

111222  I — 112122 
t—  121212 
I — 212121 
I—  221211 
I—  222111 

is  a 0-computation  for  G^,  while 

111222  | — 112122 
I — 121122 
I — 211212 
h—  212112 
I — 221121 
I — 221211 
(—  222111 
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Is  one  of  the  1-computations  for  on  111222. 

Note  that  two  active  but  idle  transitions  have  been  underscored. 

Evidently,  moves  all  2’s  to  the  left  and  all  l's  to  the 
right  in  both  the  O-computation  and  this  particular  1-computation.  ]s 
this  always  the  case  for  all  D and  all  D-computations  and  how  long  does 
it  take?  We  claim 

(i)  for  any  input  of  l's  and  2's,  G^  shifts 
all  2's  to  the  left  and  all  l's  to  the 
right  while  preserving  the  total  number  of  each 
for  any  O-computation, 

(ii)  for  all  D,  property  (i)  holds  as  well  for  all 
D-computations,  and 

(iii)  any  D-computation  for  G^  halts  in  time  less 
than  6(D+l)n  for  some  constant  6 > 1. 

The  claim  is  quite  intuitive  but  it  is  not  simple  to  prove  directly.  For 
example,  it  is  not  the  case  even  for  a O-computation,  that  once  a 2 begins 
moving  left,  it  continues  to  do  so  at  a rate  of  at  least  1/(D+1).  This 
is  because  a 2 can  "run  into"  a long  sequence  of  2's  and  be  blocked 
(since  the  rule  doesn't  apply)  for  a long  period  of  time.  In  short,  it  is 
quite  possible  for  a 2 to  exhipit  a "hurry-up  and  wait"  behavior. 

Assertions  (i)  - (iii)  are  in  fact  true,  but  to  prove  them  we  employ 
some  of  the  theory  developed  in  the  later  sections.  Our  purpose  in  proving 
the  assertions  now  is  to  underscore  the  proof  strategy  which  we  shall  employ. 
We  believe  that  it  is  an  effective  method  of  reducing  the  complexity  of 
this  type  of  proof  and  is,  therefore,  worthy  of  special  emphasis. 


The  argument  takes  the  following  form: 


(1)  verify  directly  that  the  grammar  computes  the  proper  result 
for  some  convenient  D-computat ion , 

(2)  find  the  time  t required  for  the  O-computation, 

(3)  show  that  the  grammar  (or  one  equivalent  to  it)  is  elementary ,T 

(4)  appeal  to  the  CR  theorem  (Section  4)  which  says  that  all 
D-computations  for  elementary  grammars  compute  the  same  res  lit 
and  the  time  is  less  than  (D+l)t. 

This  strategy  renders  the  proof  of  (i)  - (iii)  quite  painless  as  we  s lall 
now  see . 

For  correctness,  it  will  be  necessary  to  establish  that  the  output  is 
correct  only  for  a single  D-computation . Naturally,  we  choose  D to  sim- 
plify the  proof  and  this  is  often  the  O-computation  or  an  “-computation , 
i.e.  one  where  a specific  sequence  is  chosen  without  regard  for  how  long 
an  active  position  is  delayed.  For  the  problem  at  hand,  a specific 
1-computation  is  most  convenient.  In  particular,  for  any  string  x^*  of 
1 \s  and  2's,  we  choose  the  1-computation  with  the  property: 


0 

x 


i 

x 


x^  provided 


0 0 
x.  x 
1 J+l 


i+1  . , , 

x provided 


12  — *■  21  is  applied  to  all  pairs 
12  such  that  j is  even, 
all  active  coordinates  in  x^  actually  change 


(i  > 0) 


This  is  clearly  a 1-computation, 
odd  will  be  delayed  one  step;  the 


since  any  x 
computat ion 


0 

j 


0 

Xj+1 

is  then 


12  where  j if. 
synchronous  theieafter. 


See  Definition  2.8 
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The  single  change  property  is  fundamental  since  it  enables  a device 

to  change  independently  of  its  neighbors.  This  is  obviously  not  a property 

of  G^,  but  it  can  be  modified  to  be  single  change. 

Example  2.5.  Let  G ^ = <1,  P>  where 

£ = {1,2,A,B}  and 

P = {12  — * 1A 

1A  BA 

BA  -*•  31 

B1  — >•  21 

B2  -*  22}. 

The  intuition  here  is  that  the  intermediate  states,  A and  B,  implement 

an  "information  passing"  protocol  where  A means  "a  2 is  being  sent 

left  and  acknowledgement  of  receipt  is  requested"  and  B means  "the  2 

has  been  received  and  is  hereby  acknowledged."  Thus,  the  first  four 

productions  accomplish  the  12  to  21  interchange.  Production  five 

is  required  because  a 1 (placed  by  BA  —*  Bl)  could  have  already  been 

changed  into  a 2 due  to  the  asynchronous  execution. 

Grammar  G ^ computes  the  same  result  as  G^,  but  it  is  not 

true  that  its  time  satisfies  that  required  in  (iii)  above.  Indeed, 

2 

the  worst  case  behavior  of  G 2 is  6((D+l)n)  . The  difficulty  can 

be  observed  in  the  following  example. 

Example  2.6.  Given  G2,  a portion  of  a legal  1-computation  for  G^  is 

1 11111222222  I — 111111A22222 
l — 11111BA22222 
I — 11111B122222 
b—  11111B1A2222 
f—  11111BBA2222 


111112222221 
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The  underscored  transition  did  not  "fire"  and  is  now  "locked  out."  n the 
worst  case,  it  can  remain  so  until  a single  1 propagates  all  the  way  to 
the  right  and  the  B's  change  back  into  2's.  It  will  then  change  according 
to  the  last  production.  Apparently,  the  computational  requirements  are 
satisfied  for  , but  not  the  timing  requirements.  The  solution,  ol  course, 
is  to  add  a new  production. 


Example  2.7.  Let  = <{1,2,A,B},  P> 

P = {12  — 1A 
1A  — BA 
BA  — ► Bl 
B1  — 21 
B2  — 22 
BB  — * 2B } 


The  difficulty  with  is  that  it  is  not  persistent. 

Definition  2.6.  An  asynchronous  grammar  G = <2,  P>  is  persistent  if 

Vx,i,p  and  p';  x y with  x^  ^ y^  and 

x I — , y ' with  x . = y ' 

p l J i 

implies  3p"eP  such  that  y'  y"  with  y^  = y^. 

Persistence  prevents  an  active  transition  from  being  "locked  out"  by  the 
activity  of  its  neighbors.  Evidently,  G^  is  persistent.  G^  also  has 
the  other  property  required  of  an  elementary  grammar,  determinacy. 


Definition  2.7.  An  asynchronous  grammar  G = <2,  P>  is  determinate 

if  Vx,  i,  p and  p'  and  coordinate  i active  in  x,  if 

x | — y with  x.  4 y.  and 
p i i 

x h-ty'  with  xi  i y^ 


then  yt  = y*. 
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Determinacy  requires  that  there  be  a unique  next  state  for  any  state  change. 
Hote  that  G^  and  G^  are  also  determinate  asynchronous  grammars. 

Definition  2.8.  An  asynchronous  grammar  is  elementary  if  it  is 
single  change,  determinate,  and  persistent. 

Evidently  is  elementary,  so  we  can  appeal  to  the  CR  theorem,  thus 

establishing  (i)  - (iii)  for  G^.  Generally,  we  will  be  working  with 
single  change  grammars  such  as  G^  and  hence  we'd  be  finished.  Since 
G | is  not  single  change  a few  additional  observations  are  required. 

Clearly,  for  any  D-computat ion  of  G^  on  x,  each 

i . i+1 

x I — x 

can  be  replaced  by 

i , 1 2 3 , i+1 

x I — y I — y t — y f — x 

where  y ^ has  A's  in  all  positions  that  change  from  2 to  1 as 
x^  f — x^+\  y2  has  B's  to  the  left  of  the  A's  of  and  y^  has 

l's  in  A positions  y . This  is  a legal  4D-computation  for  G^. 

Hence  (i)  and  (ii)  must  be  true  for  G^.  Furthermore,  if  there  is  nc 
6 such  that  all  D-computat ions  of  G^  are  time  bounded  by  6(D+l)n, 
no  bound  could  exist  for  G^.  By  the  CR  theorem  it  does  exist  and 
hence  (iii)  is  established  for  G^. 

De f i n i t ion  2.9.  Let  G = <Z,  P>  be  an  asynchronous  grammar  and 

x'*  i A D-computation  x^,x^,***,  x™  is  said  to  halt  if  ^x  e Jn 

such  that  xm  I — x.  In  this  case  xm  is  called  the  result  of  the 


D-comput  at  ion . 
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3 . Synchronization  Capabilities 

In  this  section  we  demonstrate  that  asynchronous  linear  arrays  cannot 
synchronize  in  any  meaningful  way.  This  will  be  done  by  showing  that  a 
problem,  weaker  than  the  "firing  squad  synchronization  problem"  cannot  be 
solved.  As  noted  previously,  it  is  not  surprising  that  the  "standard 
version"  of  this  problem  cannot  be  solved,  but  it  is  not  even  possible  for 
the  two  "soldiers"  at  the  ends  to  "fire"  at  approximately  the  same  time. 

Definition  3.1.  An  asynchronous  grammar  G = <£,  P>  solves  the 
t, II  firing  squad  problem  provided 

| 

(a)  {g,c,q}  c E,  £ l and 

•ff 

(b)  i.  a — *■  6 e P implies  cx  i {q,c} 
ii.  for  any  D-computat ion  x^,  x\  • • • ,xm  that  halts 

0 n 

(where  x = gq  c).  It  - t I S f(n) 

8 c 

where 

t = min{k|x^  £ ^ } 

g J.  ' j 

t = min{k|xk  e J5  } 
c 1 n+2  ^ 

iii.  All  D-computat ions  can  be  extended  to  halting  D-computat ions. 

Informally,  the  conditions  can  be  viewed  as  follows:  g = "general," 
q = "null  state  soldier,"  c = "colonel,"  and  id  = the  set  of  "shoot" 
states;  g and  c mark  the  ends  of  the  array.  Condition  (bi)  requires  that 
the  process  be  initiated  by  the  "general,"  (bii)  requires  that  the  firing 
times  of  the  "general"  and  "colonel"  be  within  f(n)  of  each  other.  Note 

J 

that  the  definition  of  t and  t requires  that  only  the  first  "bullet" 

g c 

fired  by  the  general,  and  the  first  "bullet"  fired  by  the  colonel  are  of 
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interest.  Thus,  the  classic  solution  solves  the  *-0,0>  firing  squad 
problem,  i.e.  the  problem  is  solved  synchronously  with  the  general  and 
colonel  (and  all  other  soldiers)  firing  at  the  same  time. 

Theorem  3.1.  Suppose  that  the  asynchronous  grammar  G • *»I,  P- 
solves  the  <f,D>  firing  squad  problem  with  D > 0.  Then  there  exists 
a constant  6 > 0 such  that  f(n)  i fin. 

Proof . Let  G be  given  as  required  by  Definition  3.1,  and 

x°,x^,-**,xm  be  a halting  0-computation  where  x^  e {gq  c},  |x^|  = n+2  and 

t = min{lc  | xR  e si } 
g 1 

t = minfk I xR , „ e J } . 
c n+2 

For  each  peP,  writing  p as  aay  — v a'by  with  a 4 b indicates  the 

rightmost  changed  symbol  in  p.  Then  let  w = max{|a'| IpeP}. 

R P 

Similarly  writing  p as  aay  — *•  aby ' with  a f b indicates  the 

leftmost  change  in  p.  Let  w = max{ | y ' j | ptF)  . 

P 

Finally,  we  define  w as  w = tnaxfw  ,w  !. 

L R 

That  is,  w is  the  maximum  distance  over  which  a single  production 

application  can  cause  a change.  Therefore,  as  with  the  classical  firing 

squad  problem,  no  signal  can  travel  from  g to  c in  less  than  n/w 

steps;  thus  t ^ n/w.  Now,  suppose  that  f(n)  ^ n/2w  since  if  it  were 

not,  the  theorem  is  true.  But,  t > n/2w  since  otherwise  I t -t  I > n/2w. 

g eg 

Therefore,  both  t and  t are  at  least  n/2w. 
g c 

Suppose  t ^ t£  (the  other  case  is  analgous).  Without  loss  of 

generality,  let  n/4w  be  an  integer.  Define  k * t - n/4w.  Now,  there 

exists  a 1-computation 

0 1 k k+1  k+s 

X ,x  .•••,x  , y , * • • ,y 
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with  the  property  that  yj ’^2 ’ * " * *^n/2  ^re  at  each  step  (if  possible) 
and  yn/2+i“'yn+2  possible)  at  every  second  step. 

Then  the  following  are  true: 


...  k+1  k+1 

(1)  xj  *yj 

....  k+i  k+i 

(li>  xj  ’yi 


j = 1,2,  • • • ,-r-  - w and,  in  general 


j = 1,2, • ••  - iw. 


k+i  k+i 

Note  that  here  x.  refer  to  the  O-coraputation  and  y.  refer  to  the 
J 3 

1-computation.  For  i = 1,  by  definition  of  the  1-computation  all 

1 1 

transitions  on  y^  •••yn  take  place  exactly  as  for  the  O-computation. 

f "W 

Thus  (i)  follows.  For  i > 1 a simple  induction  proves  (ii). 

t t 

When  i = n/4w,  then  Xjg  = y_.8  on  machines  j = l,*»*,n/4.  Hence 

t = t ’ where 
8 8 

t 1 = min{ a I vf  e J?  } 

g 1 10 

and  the  "general"  fires  at  the  same  step  in  this  particular  1-computation 
as  in  the  O-computation.  Now,  analyzing  when  the  "colonel"  fires,  we 


claim  that  for  i = l,2,***,n/4w 

.....  k+i  2k+2i  , . 

(iii)  Xj  = y^  for  j = n/2  + 1 + iw,*-*,n+2. 

2 k+2 i 2 k+2 i 1 

of  course,  by  construction,  y^  1 = y ~ for  j = n/2  + 1,»««,  n + 2. 
Clearly,  (iii)  holds  also  by  induction.  Now,  when  i = n/4w  - 1,  then 


t -1  2t  -2 


for  j = n/2  + 1 + n/4  - w,***,n  + 2. 


t -1  2t  -2  . 

In  particular,  for  large  enough  n,  x^  = yn+®  . Define 


tg  - mln{  lly^+2  « 


t -i  2t  -2 

Since  t * t , x . / . Therefore  y ® 


r 


] 


i 

i 


r. 

■ < 

<: 

l i 


I 
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Consequently , 

t ' > 2t  - 2 . Thus 
c g 

|t’  - t | 

> t - 2 , so 

c g1 

g 

It'  - t'| 

> t - 2 . 

C g1 

g 

It  follows  that 


t (n)  n/2w  - 1 . LJ 

We  note  that  although  our  theorem  states  only  that  f(n)  £ 6n  we  have 
actually  proven  a somewhat  stronger  result,  namely  that  f (n)  £ n/2w  - 1. 
The  sharpness  of  this  result  is  particularly  noticeable  for  the  case  of 
an  ordinary  array  of  machines  with  | ot | = 3 mentioned  earlier.  In  this 
case  w = 1 so  our  result  becomes  f (n)  > n/2  - 1.  However,  such  a 
single  change  grammar  can  readily  be  devised  to  "find  the  middle"  of  an 
array.  Briefly,  this  can  be  accomplished  by  first  getting  the  array 
into  a condition  of  "alternating  1's  and  2's"  in  machines,  then 
applying  grammar  of  Example  2.7  to  move  all  2's  to  the  left  and  l’s 

to  the  right.  Finally,  the  21  boundary  indicates  the  "middle."  Now, 
firing  from  the  middle  outward  gives  a firing  for  which  f(n)  < nD/2. 

The  worst  case  being  that  one  half  fires  at  the  fastest  possible  rate  and 
the  other  half  fires  at  the  slowest  possible  rate.  Thus,  for  D = 1 we 
get  the  very  tight  lower  and  upper  bounds  of  n/2  - 1 < f(n)  S n/2. 
Although  it  seems  clear  that  D should  enter  the  lower  bound  in  a 
multiplicative  way,  so  as  to  give  fairly  tight  lower  and  upper  bounds 
for  any  D,  it  is  not  immediately  apparent  how  D could  be  appropriately 
introduced  into  the  proof. 

Theorem  3.1  can  be  viewed  as  being  sharp  in  still  another  sense. 

Given  any  D and  any  c > 0 there  is  an  asynchronous  grammar  that  solves 


i 


1 


f 


the  <f,D>  firing  squad  problem  with  f(n)  = en.  The  solution  is  just  to 

have  the  general  send  a signal  down  to  the  colonel.  This  signal  causes  each 

machine  to  fire  as  it  receives  it.  Since  one  can  clearly  send  a signal  from 

the  general  to  the  colonel  in  Sen  time  by  using  productions  of  the  form 
k k+1 

sq  — *•  s (s  = shoot  state,  q = quiescent  state)  for  k large,  it 

follows  that  one  can  solve  the  <f,D>  firing  squad  problem  with  f(n)  = en. 


■mbi 
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4.  The  CR  Theorem 

The  objective  of  this  section  is  to  prove  the  CR  theorem  which 
states  that  all  D-computations  for  a given  input  and  elementary  grammar 
yield  the  same  result  and  the  time  required  is  less  than  or  equal  to 
(D+l)  times  the  O-computation  time  for  that  input.  Several  preliminaries 
are  required  prior  to  the  statement  and  proof  of  the  theorem. 

This  entire  section  implicitly  refers  to  an  elementary  grammar 
G = <£,  P>. 

Definition  4.1.  Let  x e In  and  1 < k £ n.  Define  f^(x), 
a substitution  function  as 


fkoo 


( yabBs 


if  3aa 8 — *-  abB  e P such  that 
x = yaagz  and  |yaa|  = k, 


undefined  otherwise. 


Since  G is  elementary,  it  is  determinate  and  hence  f^Cx)  is  well 
defined.  We  next  establish  that  for  a single  transition,  the  order 
of  application  of  the  productions  is  immaterial.  Notat ionally , if 
f,  g,  h are  substitution  functions  for  x,  let  fgh(x)  denote  the  repeated 
function  application 
f (g  (h (x) ) ) . 

A A simple  consequence  of  the  definition  of  I — and  the  substitution 

functions  is  given  in  the  first  lemma. 

Lemma  4.1.  x I — y if  y = f . • • • f , (x)  where  f . (x j is 

d d d. 

1 q i 

defined,  1 £ 1 £ q. 


I 


Lemma  4.2. 


Let  x e L and  f (x) , 1 £ i £ q be  a set  of 
di 


substitution  functions  defined  for  x.  Then  for  any  permutation  tt  on 


i ■•■fd  (x)  = ‘“fd  (x)  = y e £ 

it.  tt  1 q 

1 q 


Proof . If  fr(x)  and  f„  (x)  are  defined,  then  r ^ s implies 


f f (x)  = f f (x)  = z e £ , 
r s s r 


since  G is  determinate  and  persistent.  A straightforward  induction  on 
the  number  of  interchanges  required  to  reorder  tt  to  l***q  completes 
the  proof.  LJ 


Suppose  fj  , H i S r is  the  set  of  all  substitution  functions 
i 


defined  for  x,  then  we  define  x t—  y as  y = f •••f,  (x) . We  a] so 

par  J J d,  d 

1 r 


note  that  x Hr  x'  if  x'  = f (x)  for  some  k,  1 < k < r. 

dk 


The  next  lemma,  illustrated  in  Figure  4.1a,  will  be  an  integral  part  of  the 
induction  of  the  subsequent  lemma. 

* 


Lemma  4.3.  If  xrl.x  Hr  y , x f — - x ' . y I — y ' imn  1 v x ' Hr  v ' . 
1 par  J par  v J 1 3 


pai 


Proof.  Let  f (x)  be  all  defined  substitution  functions  for  x, 
d . ’ 

l 


1 1 r.  From  the  definition  of  | — , and  Lemma  4.2 

par 


V’-fd  (x) 

1 r 


(1) 


and 


y = fd  (x) 
r 


with  renaming  if  necessary.  Let  f (y)  1 s i < q be  all  defined  functions 

ci 


for  y.  Then 

y'  - f •••f  f . (x). 


symbolic  graphic 


a LEMMA  4 3 


b LEMMA  4 4 W1 


c LEMMA  4 5 


d STATEMENT  OF  CR  THEOREM 


CLAIM^\  ^(KHKDH) 


* PROOF  OF  FART  (I)  OF  CR  THEOREM  f PROOF  OF  PART  (2)  OF  CR  THEOREM 


Figure  4.1  Graphic  representation  of  selected  lemmas  and  theorems 


% 


25 


By  persistence  f^  (x)  is  defined  for  all  i,  1 S i < r - 1, 

i r 

and  so  {f,  ,***,f,  } E (f  By  Lemma  4.1  then  we  can 


reorder  and  substitute  to  get 


y =f  • • *f  f • • *f  (x) 
e,  e d,  d 
1 si  r 


for  some  f where 
e . 

l 

e.  * ’ e ' c.  ’ ’ c ' trd.  * ’ d /• 

1 s 1 q 1 r-1 

Combining  (1)  and  (2)  and  using  the  definition  of  by  , it  follows  that 


x ty  y 


Lemma  4.3  is  generalized  in  Lemma  4.4.  The  lemma  and  proof  are  illustrated 
by  Figure  4.1b. 

n * * 

Lemma  4.4.  If  x € I , x Hr  y,  x ) — x , y | — v then  xf  y*  . 
1 J ’ par  y par  y 1 J 

Proof . We  use  induction  on  the  number  of  applications  of  f-y  in 

going  from  x to  y.  If  x = y the  result  is  trivial  and  when  the  length 


is  1,  the  resuLt  follows  from  Lemma  4.3.  Let 


x = w ry  w 


k . k+1 

y w by  w = y 


k '2.  Let 


i,  - i 
w I — w 
par 

By  Lemma  4.3, 


0 S i < k+1 


- i * - i+1 
w by  w 


0 s i s k 


Thus , 


- 1 * - k+1 

w by  w 


— 1 — k.+ 1 * 

and  since  x = w and  y'  = w it  follows  that  x'  — y'.  □ 


i 

i 
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Figure  4.1c  illustrates  the  next  lemma. 


Lemma  4.5.  If  x e En,  x I — y and  x I — v'  then  v I — v'  . 
J par  J J 1 1 


Proof.  Let  f,  (x)  be  all  defined  substitution  functions  for 
dA 


x,  1 < i < r. 
Then 


y = (x) 


J r 

for  some  j < r by  Lemma  4.1  and  (possibly)  renaming. 


But 


y* = fd/’,fd (x)  = fd/--fd  ,(y) 
1 r 1 j-1 


and  hence  y f-y  y'.  □ 


We  can  now  state  the  main  result  of  this  section. 

CR  Theorem.  Let  G = <E,  P>  be  an  elementary  asynchronous  grammar 


and  x e En.  Let  x = u^,u  ,u^,*‘*,u^  be  a halting  O-computation  of  G on 


x and  x = v° .v1 ,v2 , • • • .v™  be  any  halting  D-computat ion  of  G on  <.  Then 


(1) 

v 

i . 

u and 

(2) 

i , * 

u tT 

i (D+l) 

V 

0 < i < 

Note:  If  m < 2.(IH-1)  we  suppose  the  D-computat  ion  has  been  extended  such 

that  m < p < f.  (D+l)  implies  vP  = vm.  This  theorem  and  its  proof  ar.; 
illustrated  in  Figure  4. Id  - 4. If. 

Proof . 


(1)  We  prove  this  by  induction  on  i.  For  i = 0,  it  is  trivially 
true.  By  hypo the 
0-computation  u^ 


i * i 

true.  By  hypothesis  v Hj-  u for  i < k.  By  definition  of 


_ uj  1 0 £ j < i.  Define  v^  I — w 

par  par 


for  some  w.  By  Lemma  4.4,  w |-y  u + . But,  by  Lemma  4.5 

k+1  l*  tu  f k+1  l*  k+1 

v |-y  w.  Therefore  v |-y  u 

(2)  This  is  also  proved  by  induction  on  i.  For  i = 0,  the  result 

i * i(w-l) 

is  trivial.  By  induction  hypothesis,  u (-y  v , for  i < k, 

and  by  definition  of  0-computation  u''  u^+^  for  0 $ j < i . 

_ k (D+l ) , * k (D+l )+D+l  /tT  u , 

Claim,  dw  such  that  v I — w Hr  v . (We  prove  the  claim 

par  1 r 

, . \ d t ii  k+1  , * , , c k+1  , * k(D+l)+D+l 

in  a moment.)  By  Lemma  4.4  u hy  w and  therefore  u by  v 

To  see  that  the  claim  is  true,  let  f , (v^^D+^^)  be  all  defined  functions, 

di 

then 

, , , k (D+l) . 

w = f d * * * f d V > 

1 q 

by  definition  of  | — . For  1 < j < D + 1 define  f .,•••, I . to  be  all 
Par  cJ 

defined  substitution  functions  such  that  J 

k (D+l  )+j  , . , k ( D+l ) + j -1 . 

v J = f . • • *f  . (v  J ). 

cd  cJ 

1 qj 

But  by  persistence,  the  definition  of  D-computat ion  and  a reordering 

argument  similar  to  that  of  Lemma  4.2,  3f  .••*,f  such  that 
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k(D+l)+D+l  k(D+l) . 

v =f  •••f  f • • • f . (v  ) 

e.  e d,  d 
1 r 1 q 

and  the  claim  follows.  □ 

The  import  of  the  CR  Theorem  can  be  seen  in  the  following  corollaries. 


Corollary  4.1.  Let  G = <£,  P>  be  an  elementary  asynchronous  grammar 
and  x e l".  If  y £ l"  is  the  result  of  any  halting  D-computat ion  of  G 


on  x,  then  y is  the  result  of  all  halting  D-computations  of  G on  x. 

Corollary  4.2.  Let  G = <E,  P>  be  an  elementary  asynchronous  grammar 

and  x e £n.  If  a O-computation  of  G on  x halts  in  l steps,  then  any 

D-computat ion  of  G on  x halts  in  less  than  or  equal  to  Jl(D+l)  steps. 

It  is  important  to  note  that  the  requirements  of  determinacy  and 

persistence  are  necessary  in  the  sense  that  the  CR  Theorem  is  false  if  they 

are  eliminated.  This  is  clear  for  determinacy.  Grammar  G ^ from  Section  2 

2 

which  Is  not  persistent  and  which  executed  in  approximately  n rather 
than  n steps,  demonstrates  this  for  persistence.  We  shall  have  occasion 
to  use  these  two  corollaries  in  the  next  section  on  recognition  capabilities 
The  connection  between  asynchronous  computation  and  the  Church-Rosser 
property  has  been  observed  before  by  several  researchers  [6,  8].  The 
contribution  here  is  that  in  the  presence  of  bounded  delay  (D  < «>)  not  only 
do  asynchronous  computations  "behave  the  same"  but  they  operate  in  "about 
the  same  time."  In  particular  Corollary  4.2  is  new,  while  Corollary  4.1  for 
= °°  can  be  proved  fairly  directly  from  results  in  [6]  and  [8]. 
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') . Recognition  Properties  of  Linear  Asynchronous  Grammars. 

The  goal  of  this  section  is  to  argue  that  the  sets  recognizable 
by  cellular  1-dimensional  arrays  [10]  in  time  t can  also  be  recognized 
by  linear  asynchronous  grammars  in  time  3(D+l)t  for  all  delays  D > 0. 

Since  cellular  arrays  can  solve  a wide  class  of  recognition  problems 

in  a synchronous  and  efficient  manner,  we  can  conclude  that  these  problems 

can  also  be  performed  asynchronously  without  serious  time  degradation. 

The  overall  strategy  begins  by  noting  that  any  single  change,  determi- 
nate asynchronous  grammar  (of  which  cellular  arrays  are  a special  case)  can 
be  put  into  a normal  form  with  certain  properties.  The  next  step  is  to 
show  how  to  construct  a persistent  grammar  from  the  normal  form  grammar, 
sucli  that  both  grammars  produce  the  same  output  in  the  synchronous  case 
(0-computat ions) . Finally,  we  appeal  to  the  CR  Theorem  to  establish  that 
for  any  single  change,  determinate  asynchronous  grammar,  there  exists  an 
elementary  grammar  accepting  the  same  set  in  time  3(EH-l)t,  where  t is  the 
recognition  time  for  a 0-computation  of  the  original  grammar.  The  desired 
result  then  follows  as  a corollary  since  cellular  1-dimensional  arrays 
correspond  to  single  change,  determinate  asynchronous  grammars. 

To  simplify  the  exposition,  we  omit  two  details.  First,  we  omit  the 
construction  of  the  normal  form  and,  secondly,  we  ignor  the  details  involv- 
ing the  end-points  of  the  array.  For  convenience,  the  reader  can  suppose 
that  the  configurations  are  bounded  by  end  markers  and  appropriate  produc- 
tions exist  for  handling  the  markers.  The  general  case  is  unaffected  by 
this  assumption. 

Let  G = <E,  P>  be  any  single  change,  determinate  asynchronous 
grammar.  (Note  that  cellular  1-dimensional  arrays  satisfy  this  requirement.) 


I 


Definition  5.1.  A single  change,  determinate  asynchronous  grammar 

Gm  = <E,  P > is  a normal  form  for  G,  if  there  exists  a k such  that 
N N 

(i)  Va  6 e PN,  |a|  = k 


(ii)  a, 


•b,  c P„,  implies  a.  • 
k N 1 


‘Lk/2j  = bl' 


[_(k+l)/2j+l"*ak  = b|_(k+l)/2j+r"Dk 


(iii)  a — *■  8 e P <=*>  3p  e P such  that  at  I — 6- 
N p 

Informally,  requirement  (i)  states  that  all  productions  are  of 
the  same  size,  while  (ii)  guarantees  that  the  modification  is  to  the 
"middle"  term  in  the  production.  This  latter  requirement  implies,  of 
course,  that  k is  an  odd  integer.  Property  (iii)  requires  the  same 
behavior  from  the  two  production  sets  on  k length  strings.  It  can 
be  seen  that  cellular  1-dimensional  arrays  are  represented  by  normal 
form  grammars  with  k = 3.  Let  be  a normal  form  for  G and  let 

m = (k+l)/2  in  the  sequel. 

Lemma  5.1.  For  any  G,  there  exists  a G„t. 

N 

Proof . This  can  be  done  by  suitably  padding  out  productions  of 
the  original  grammar.  The  construction  is  omitted.  □ 

Lemma  5.2.  If  G is  single  change  and  determinate,  then  G^  is 
single  change  and  determinate. 

Proof . The  single  change  property  follows  directly  from  Definition 
5.1  (ii)  and  the  determinacy  property  follows  from  Definition  5.1  (iii).  □ 


Lemma  5,3.  Let  x^,***,xb  be  a O-computation  for  G and  x^  * y^.'",yb 

0 l t. 

be  a O-computation  for  G^  on  x . Then  x =y  0 S l < h. 
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Proof . By  Induction  on  2.  For  2 = 0 the  result  is  immediate. 

2,  2+1 

Suppose  it  is  true  for  all  i < 2.  If  x^  = x^  , no  p £ P applies  at 

; 2 

j in  x . If  p^  e P^  applies  at  j in  y , requirement  (iii)  is 

£-fi  a 

contradicted,  so  x.  = y.  If  x.  4 x.  , some  p e P applies  at  j 
’ 3 1 3 j J 

} 2 2 

in  x . Let  a = x . , • • *x . , , . Then  a I — B and  by  (iii ) , a — *■  B e P., 

j-m+1  j+m-1  p y N 

2 2+1  2+1 

implying  j active  in  y . Thus  = y . □ 

Having  found  a normal  form  for  G,  we  now  seek  to  construct  a 

new  grammar  G'  which  is  persistent.  We  first  require  a definition. 

Definition  5.2.  Let  G„  = <1,  P1T>  be  a single  change,  determinate 
- 1 N N 

asynchronous  grammar  in  normal  form.  The  completion  of  G„  is  a system 
G = <£,  P > such  that  P = PM  u {a  — ► a|  *B,  a — ► B e P..}. 

r»  n n M 1 > M 


Informally,  the  completion  of  G^  has  the  production  set  as  G^  with  all 
"idling"  productions  added.  Thus,  the  completion  is  not  an  asynchronous 
grammar.  This  is  no  problem  since  the  completion  will  be  transformed 
into  a legal  asynchronous  grammar  below. 

A few  comments  are  in  order  about  the  forthcoming  construction.  The 


goal  is  to  achieve  persistence.  The  technique  by  which  this  is  accomplished 
is  to  define  a protocol  that  enables  each  device  to  acquire  inputs  from 
its  neighbors.  The  protocol  is  basically  three  fold:  (1)  a device  announces 
its  intention  to  change  state.  At  this  point,  every  neighbor  that  depends 
upon  the  device's  current  value  for  their  next  state  change  must  now 
retrieve  the  input.  This  is  done  by  having  the  neighbors  announce  their 
intent  to  change  state.  When  all  the  neighbors  have  announced,  (2)  a 
device  is  allowed  to  perform  its  t ransition . After  the  device  and  its 
neighbors  have  changed  state,  (3)  they  acknowledge  that  fact  by  becoming 
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3^ 


quiescent . A new  cycle  is  then  ready  to  begin.  The  purpose,  there  ore, 
for  completing  the  grammar  is  to  enable  transitions  that  wouldn't  otherwise 
fire  to  receive  input,  even  though  no  new  state  change  will  result. 

Definition  5.3.  Given  = <£,  P^>  the  completion  of  an  asynchronous 
grammar,  define  the  alphabet  sets 


= { if  ] | a •••a  aa  •••a. 

b 1 1 m-1  m+1  k 


a • • -a  ,ba  , ■••a.eP  } 
1 m-1  m+1  k c 


<^l  V 


■a  .aa  • 
m-1  m+1 


■a  ,ba  • • *a,  eP  } 
m-1  m+1  Tc  c 


a* 


lb|[-]  e Za). 


Informally,  each  element  of  I3  is  a state  a device  enters  when  it  announces 
its  intent  to  change  from  state  a to  state  b.  Each  element  of  Z is 

ci 

a transition  state  it  enters  before  becoming  quiescent.  Each  element  of 
E * is  a quiescent  state.  We  next  define  the  productions  for  the  nev 
grammar  from  the  completion. 

Definition  5.4.  Given  the  completion  G *=  <E.  P > of  a normal 
c c 

form  grammar  define  the  production  sets 
P = 


; c , * * * c .a  c .•••c,  — *■  c •••c  , lc 

1 m-1  m m+1  k 1 m-1  b m+1 

m 


a • • - a • • >a, 

1 1 m k 


a • • *b  • • *a,  e P and 
1 m k c 


ct  e E 1 s i < k and  i ^ m}j 


a 


Tlio  P^  productions  accomplish  the  announcing  task.  Note  that  the  neighbors 

may  or  may  not  have  announced  when  a given  position  does  so.  The  P 

productions  perform  the  transition  and  they  require  that  the  neighbors 

have  either  performed  the  transition  or  announced.  The  P^  productions 

return  to  quiescent  state  where  the  neighbors  have  either  done  so  as 

well  or  at  worst,  they  have  performed  their  transition. 

Let  1'  = £a  u Y,  in  the  sequel. 

acZ  a 

Lemma  5.4.  G'  = <1  UE1,  P UP  UP  5 is  a single  change 
a t q " 

asynchronous  grammar. 

In  the  sequel  we  will  use  G'  for  this  grammar. 

Proof . Immediate,  by  cons truction . 0 

Lemma  5.5.  G'  is  determinate. 

Proof . Determinacy  is  vacuously  satisfied  since  no  two  productions 


have  the  same  left  hand  side  and  all  productions  change  only  the  middle 
symbol.  □ 


PRODUCTION  p APPLIES  AT  i IN  X 


PRODUCTION  p'  APPLIES  AT  l IN  X 


Figure  5.1. 


Production  application  with  overlap. 
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Lemma  5.6.  G'  is  persistent. 

Proof.  Let  x I — y for  some  p e P UP.  U P such  that  x.  4 y. 

P a t q i i 

and  x| — , y’  for  some  p'  e P UP  UP  such  that  x.  = y!.  Assume,  in 
p a t q li 

contradiction  to  the  persistence  requirement,  that  Vp"  e P UP  U P such 

3 L 

that  y'  I — ,,  y"  implies  yV  4 y..  Let  p = u •••u  •••u.  — *■  u •••u'**,u.  and 
p 11  1 m k 1 m K. 

o'  = v. •••v  •••v,  — ► v, • • *v ' • • • v,  . Thus  x — y implies 
1 lmk  lmk  !P 

i-m+1  i+m-1  1 k 

and  x |— , y'  implies 

x .•••x  , = v • • *v,  for  some  l with  i-m+l<£<i  + m-  l 

-m+1  i+m-1  1 k 

since  otherwise  p would  still  be  active  at  i in  x.  (See  Figure  5.1) 


Moreover , 

since 

xi  = yi* 

(*) 

X 

H* 

II 

Um  Vm+j 

(**) 

xe  = 

v = u 
m m-j 

Case  1.  (p  e P_).  By  construction,  p e P_  implies  u e E.  But  (*) 

3 3 III 

implies  v . t I,  so  pV  P . If  p'  e P , then  the  construction  implies 
v m+j  r t q 

v r E for  some  a,  but  (**)  implies  u . e E contradicting  p c P . 
m a r m-j  a a 

Therefore  pf  r P , 3nd  v1  e Z for  some  3.  But  by  construction,  a 
r 3 m J 

production  w •••w  • • ‘w.  — * w exists  such  th3t  1 s z < k, 

lmk  lmk 

7.  4 m— j,  u = w , and  w . = v'  e E . Moreover,  w'  = u'.  Thus,  the 
z z m-j  m mm 

assumption  is  false  if  p e P^. 

Case  2.  (p  e P.)*  By  construction  p c P.  implies  u e E for 
t t m 

some  a.  But  (*)  implies  v , , e E , so  p'^P.  If  p'rP,  then  the 

r m+j  r q r a 

construction  implies  v c E,  but  (**)  implies  u . e E contradicting 

m m-j 
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p e P . Hence,  p'  c P , and  v'  £ E for  some  a.  But  by  construction, 
v t t m a J 

a production  w,  • • *w  • • *w,  — *•  w,  • • *w' * • -w,  exists  such  that  1 ^ z 5 k, 
v 1 m k 1 m k 

z 4 m-i,  u = w and  w . = v'  e E . Moreover,  w'  = u'.  Thus  the 
J z z m-j  m a ’mm 

assumption  is  false,  if  p e P . 

Case  3.  (p  £ P ) . By  construction,  p e P implies  u e E for 
q q ma 

some  a.  But  (*)  implies  v , , £ E , so  p'^P.  If  p'eP,  then  the 

r m+i  a’  r a t 

construction  implies  v e Ea , for  some  a,  but  (**)  implies  u . t la 

m ‘ m-J 

contradicting  p £ P^ . Hence,  p'  e P . An  argument  similar  to  the  above 

guarantees  a production,  so  the  assumption  is  false  if  p e P . □ 

Lemma  3.7.  The  sequence  ,x^  , • • • ,x^  is  a O-computation  sequence 

for  G'  on  x^  iff  x^  = y^,y^ , • • • ,y^  is  a O-computation  sequence  for  G 

I' 

0 j i 3i  n , , 

on  x , and  y = x 0 < i < h. 

Proof . Clearly,  by  construction 


3i  n 

x £ E 


i = 0,1 ,2 , ■ • • ,3h 


X £ {a^EZa}n  1 = °»! **  * * »3(h-l) 

x3i+2  c {aUsEa}n  i = 0,1 , • • • ,3 (h-1) 


0 1 3h 


0 * 3i 


since  x ,x  , • • • ,x  is  a O-computation.  Suppose  x f—  x satisfies  the 


lemma  for  i = 0,1, •••,£.  To  prove  the  if  part,  let  p £ P apply  at  j 

X 3 

in  x^8-,  then  x'  | — x^>+^  and  x?'+^  = [f]  for  some  a,b  e E.  Each  of 

P1  i b 

the  m-1  neighbors  on  either  side  is  in  a state  chosen  from  U Ea  as  well. 

ae  E 

Hence  there  is  p„  e P_  such  that  x^P+1  | — x~^+^  and  x^f+i  = f^l. 

1 c P2  J 

Now,  the  m-1  neighbors  on  either  side  are  in  a state  chosen  from 

. . r _ , , . 3i+2  , 3(£+l) 

C/„E  . Hence  some  p.  e P applies  and  x I — x such  that 

aeE  a r3  q p^ 


^3(2+1)  _ But,  by  construction,  these  three  transitions  imply  there 


, , 32.  3(2+1)  3(1+1)  , , . 

exist  p e P such  that  x | — x with  x,  = b.  If  a f b 

c P j 


. n J 3(2+1)  2 T.  , , 

then  p c P.,  and  so  x.  = y . . If  a = b , no  production  in 

N j J 


Z Z -Hi  st 

P,,  applies  (p  was  added  by  completion)  and  y = y^  at  the  (£+1) 


step  of  the  O-computation  of  on  x . 


To  prove  the  only  if  part,  suppose  in  the  O-computation  of  on  x , 


f+1  . I £ 2+1 

y . If  p is  active  at  'j  in  y , p £ Pc»  If  y = yj  , a 


completion  production  has  been  added.  In  either  case,  according  to  the 


32  32+1 

construction  a production  p,  e P exists  such  that  x I — x with 

la  p1 


3f  + l a 2 2+1 

x.  = [, ] and  y.  = a and  y.  = b.  It  is  easilv  observed  that 
J b •'j 


, , J . , , 32+1  . 32+2  . 3(2+1) 

other  productions  exist  such  that  x I — x I — x and 


„ - yM.  □ 
J J 


Theorem  5.1.  Given  a single  change,  determinate  asynchronous  grammar 


G = ■ Y. , P>  there  exists  an  elementary  grammar  G'  = <E',P’>  such  that 
Vn  and  VD  > 0,  if  x^  £ Zn  and  x^,***,x  ^ is  halting  0-computat ion 


of  G on  x , then  31^  such  that 

0 ^2  0 
(i)  x , • • • ,x  is  a halting  D-computation  for  G'  on  x with 


h2  hl 
x = x , and 


(ii)  h_  3 (D+l )h , . 

L 1 


Proof . Porm  G^,  the  normal  form  for  G.  Complete  the  normal 


form  and  construct  G'  . The  O-computation  for  these  grammars  yield 


the  same  output  with  G'  operating  at  most  3 times  slower  than  G.  By 


the  CR  theorem  of  the  previous  section,  G'  satisfies  (i)  and  (ii) 


since  it  is  elementary,  by  Lemmas  5.4,  5.5,  and  5.6.  [] 


R 


I 


Corollary  5.1.  A set  recognized  by  a cellular  1-dimensional  array 
in  time  t is  recognized  by  some  elementary  grammar,  asynchronously,  in 
time  less  than  or  equal  to  3(D+l)t,  for  all  D > 0. 

Proof . Let  6:  E x £ x £ — ► E be  the  transition  function  for 
the  cellular  1-dimensional  array  [10].  Then  define  G = <1,  P>  where 
a a a — > a b„a  e P iff  6 (a  ,a  ,a  ) = b0,  with  a„  ^ b0, 


I* 
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<•_. (Conclusions 

What  we  have  done  in  this  paper  is  to  introduce  the  notion  of  bounded 
delay  asynchronism  and  to  study  some  of  its  properties  in  terms  of  a very 
special  structure  which  we  called  linear  asynchronous  structures.  We  have 
shown  three  main  results.  First,  that  these  structures  cannot  be  synchro- 
nized well;  the  "gap  between  firings"  is  a function  of  the  number  of  elements 
in  the  structure.  Second,  that  under  suitable  hypotheses  (elementary  gram- 
mars) the  systems  compute  unique  values,  and  are  not  much  "slower"  than 
synchronous  structures,  and  finally  that  these  systems  are  computationally 
as  powerful  as  synchronous  systems. 

Several  natural  questions  arise.  The  first  is:  What  happens  for 

more  complicated  structures?  Natural  extensions  would  be  to  higher  dimen- 
sional uniform  structures  and  to  tree  and  graph  structures  in  which  each 
node  had  the  same  in  and  out  degree.  It  would  seem  that  similar  results 
could  be  obtained. 

The  second  question  arises  from  our  CR  Theorem  in  Section  4.  We  have 
already  shown  in  Section  4 how  there  is  an  intimate  connection  between  Church- 


Rosser  systems  and  linear  asynchronous  structures.  But  our  CR  Theorem  says 
something  more  than  the  usual  Church-Rosser  type  result  by  introducing  timing, 
or  number  of  steps,  comparisons  between  the  "shortest"  and  "longest"  paths  to 
the  unique  result.  The  question,  then,  is  how  or  when  can  such  timing  results 
be  obtained  for  other  types  of  Church-Rosser  theorems?  Clearly,  they  do  not 
hold  in  general  since  some  Church-Rosser  systems  can  have  "unbounded  delay." 


Also,  one  could  have  bounded  delay  but  some  sort  of  looping  behavior  that 
could  give  rise  to  no  tight  bound  existing  for  timing.  Nevertheless,  it  would 
be  interesting  to  characterize,  for  general  Church-Rosser  systems,  when 


various  types  of  bounds  on  timing  hold. 


; 
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